Topic 21: Linear Regression with One Regressor 


Test ID: 8450958 


Question #1 of 37 Question ID: 438933 

Which of the following statements regarding the coefficient of determination is least accurate? The coefficient of determination: 

/ A) may range from -1 to +1. 

X B) is the percentage of the total variation in the dependent variable that is explained by the independent 
variable. 

X C) cannot decrease as independent variables are added to the model. 

X D) is the ratio of explained variation to total variation. 

Explanation 

In a simple regression, the coefficient of determination is calculated as the correlation coefficient squared and ranges from 0 to 
+ 1 . 


Question #2 of 37 Question ID: 438929 

Which of the following statements about linear regression analysis is most accurate? 

X A) The coefficient of determination is defined as the strength of the linear relationship between 
two variables. 

X B) A perfectly negative correlation can be depicted by a correlation coefficient of +1 . 

7 C) An assumption of linear regression is that the residuals are independently distributed. 

X D) When there is a strong relationship between two variables we can conclude that a change in one will 
cause a change in the other. 

Explanation 

Even when there is a strong relationship between two variables, we cannot conclude that a causal relationship exists. The 
coefficient of determination is defined as the percentage of total variation in the dependent variable explained by the independent 
variable. 


Question #3 of 37 Question ID: 438934 

A simple linear regression equation had a coefficient of determination (R 2 ) of 0.8. What is the correlation coefficient between the 
dependent and independent variables and what is the covariance between the two variables if the variance of the independent 
variable is 4 and the variance of the dependent variable is 9? 


Correlation 

coefficient Covariance 

X A) 0.91 4.80 
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X B) 0.89 


4.80 


X C) 

0.91 

5.34 

y d) 

0.89 

5.34 


Explanation 

The correlation coefficient is the square root of the R 2 , r = 0.89. 

To calculate the covariance multiply the correlation coefficient by the product of the standard deviations of the two variables: 
COV = 0.89 x V4 x V9 = 5.34 


Question #4 of 37 Question ID: 438924 

Consider the regression results from the regression of Y against X for 50 observations: 

Y = 5.0 - 1.5 X 


The standard error of the estimate is 0.40 and the standard error of the coefficient is 0.45. The predicted value of Y if X is 10 is: 


/ A) -10. 

X B) 10. 
X C) 4.5. 
X D) 20. 


Explanation 

The predicted value of Y is: Y = 5.0 - [1 .5 (10)] = 5.0 - 15 = -10 


Question #5 of 37 Question ID: 438936 

Unlike the coefficient of determination, the coefficient of correlation: 

7 A) indicates whether the slope of the regression line is positive or negative. 

X B) indicates the percentage of variation explained by a regression model. 

X C) measures the strength of association between the two variables more exactly. 

X D) can have an absolute value greater than 1 . 

Explanation 

In a simple linear regression the coefficient of determination (R 2 ) is the squared correlation coefficient, so it is positive even 
when the correlation is negative. 


Question #6 of 37 Question ID: 438915 

The purpose of regression is to: 
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X A) explain the variation in the independent variable. 

X B) get the largest R 2 possible. 

X C) explain the mean of the independent variable. 

7 D) explain the variation in the dependent variable. 

Explanation 

The goal of a regression is to explain the variation in the dependent variable. 


Question #7 of 37 Question ID: 438910 

In a regression analysis, the effects from independent variables that are not included in the model are embodied in the: 

y A) error term. 

X B) slope coefficient. 

X C) intercept. 

X D) scattergram. 

Explanation 

The error term embodies the effect of variables omitted from the model. 


Question #8 of 37 Question ID: 438931 

Linear regression is based on a number of assumptions. Which of the following is least likely an assumption of linear regression? 

X A) A linear relationship exists between the dependent and independent variables. 

y B) There is at least some correlation between the error terms from one observation to the next. 

X C) Values of the independent variable are not correlated with the error term. 

X D) The variance of the error terms each period remains the same. 

Explanation 

When correlation exists, autocorrelation is present. As a result, residual terms are not normally distributed. This is inconsistent 
with linear regression. 


Question #9 of 37 Question ID: 438921 

A simple linear regression is run to quantify the relationship between the return on the common stocks of medium sized 
companies (Mid Caps) and the return on the S&P 500 Index, using the monthly return on Mid Cap stocks as the dependent 
variable and the monthly return on the S&P 500 as the independent variable. The results of the regression are shown below: 



Coefficient 

Standard Error of Coefficient 

t-Value 

Intercept 

1.71 

2.950 

0.58 

S&P 500 

1.52 

0.130 

11.69 

R 2 = 0.599 
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Use the regression statistics presented above and assume this historical relationship still holds in the future period. If the 
expected return on the S&P 500 over the next period were 11%, the expected return on Mid Cap stocks over the next period 
would be: 

/ A) 18.4%. 

X B) 33.8%. 

X C) 20.3%. 

X D) 25.6%. 

Explanation 

Y = intercept + slope(X) 

Mid Cap Stock returns = 1.71 + 1.52(11) =18.4% 


Question #1 0 of 37 Question ID: 438938 

An analyst performs two simple regressions. The first regression analysis has an R-squared of 0.40 and a beta coefficient of 1 .2. The 
second regression analysis has an R-squared of 0.77 and a beta coefficient of 1 .75. Which one of the following statements is most 
accurate? 

7 A) The second regression equation has more explaining power than the first regression equation. 

X B) The R-squared of the first regression indicates that there is a 0.40 correlation between the independent 
and the dependent variables. 

X C) The first regression equation has more explaining power than the second regression equation. 

X D) The beta coefficient of the 2nd regression indicates that this regression has more explaining power than 
the first. 

Explanation 

The coefficient of determination (R-squared) is the percentage of variation in the dependent variable explained by the variation in the 
independent variable. The larger R-squared (0.77) of the second regression means that 77% of the variability in the dependent variable is 
explained by variability in the independent variable, while only 40% of that is explained in the first regression. This means that the second 
regression has more explaining power than the first regression. Note that the Beta is the slope of the regression line and doesn't 
measure explaining power. 


Question #11 of 37 Question ID: 438913 

In the scatter plot below, the correlation between the return on stock A and the market index is: 
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X A) zero. 

X B) not discernable using the scatter plot. 

/ C) positive. 

X D) negative. 

Explanation 

In the scatter plot, higher values of the return on stock A are associated with higher values of the return on the market, i.e. a 
positive correlation between the two variables. 


Question #12 of 37 Question ID: 438911 

Which of the following statements regarding scatter plots is most accurate? Scatter plots: 

X A) illustrate the scatterings of a single variable. 

X B) are used to examine the third moment of a distribution (skewness). 

/ C) illustrate the relationship between two variables. 

X D) are used to examine the fourth moment of a distribution (kurtosis). 

Explanation 

A scatter plot is a collection of points on a graph where each point represents the values of two variables. They are used to 
examine the relationship between two variables. 


Question #13 of 37 Question ID: 424959 

Which of the following is least likely an assumption of linear regression? The: 

X A) expected value of the residuals is zero. 

V B) residuals are mean reverting; that is, they tend towards zero over time. 

X C) residuals are independently distributed. 

X D) variance of the residuals is constant. 

Explanation 

The assumptions regarding the residuals are that the residuals have a constant variance, have a mean of zero, and are 
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independently distributed. 


Question #14 of 37 Question ID: 438940 

Assume an analyst performs two simple regressions. The first regression analysis has an R-squared of 0.90 and a slope 
coefficient of 0.10. The second regression analysis has an R-squared of 0.70 and a slope coefficient of 0.25. Which one of the 
following statements is most accurate? 

X A) The influence on the dependent variable of a one unit increase in the independent variable is 
0.9 in the first analysis and 0.7 in the second analysis. 

X B) Results of the second analysis are more reliable than the first analysis. 

X C) The influence on the dependent variable of a one unit increase in the independent variable is 0.7 in 
the first analysis and 0.9 in the second analysis. 

/ D) The first regression has more explanatory power than the second regression. 

Explanation 

The coefficient of determination (R-squared) is the percentage of variation in the dependent variable explained by the variation in 
the independent variable. The larger R-squared (0.90) of the first regression means that 90 percent of the variability in the 
dependent variable is explained by variability in the independent variable, while 70 percent of that is explained in the second 
regression. This means that the first regression has more explanatory power than the second regression. Note that the Beta is 
the slope of the regression line and doesn't measure explanatory power. 


Question #1 5 of 37 Question ID: 424957 

Which of the following is least likely an assumption of a simple regression? 

7 A) The variance of the error term is one. 

X B) The expected value of the error term is zero. 

X C) There is a linear relationship between dependent and independent variables. 

X D) The error term is normally distributed. 

Explanation 

There is no requirement that the variance of the error term should be equal to one. 


Question #16 of 37 Question ID: 424960 

Which of the following is least likely an assumption of linear regression? 

X A) The variance of the residuals is constant. 

X B) There is a linear relation between the dependent and independent variables, 
y C) The independent variable is correlated with the residuals. 

X D) The residuals are normally distributed. 

Explanation 
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The assumption is that the independent variable is uncorrelated with the residuals. 


Question #17 of 37 Question ID: 438916 

The capital asset pricing model is given by: R, =Rf + Beta ( R m -Rf) where R m = expected return on the market, Rf = risk-free market and 
Rj = expected return on a specific firm. The dependent variable in this model is: 

V A) Rj. 

X B) R m . 

X C) R f . 

X D) R m - R f . 

Explanation 

The dependent variable is the variable whose variation is explained by the other variables. Here, the variation in r, is explained by the 
variation in the other variables, r f and r m . 


Question #1 8 of 37 Question ID: 438912 

If the correlation between two variables is -1.0, the scatter plot would appear along a: 

X A) a curved line running from southwest to northeast. 

X B) straight line running from southwest to northeast, 
y C) straight line running from northwest to southeast. 

X D) a curved line centered in the scatter plot. 

Explanation 

If the correlation is -1.0, then higher values of the y-variable will be associated with lower values of the x-variable. The points would lie 
on a straight line running from northwest to southeast. 


Question #19 of 37 Question ID: 438932 

Assume you perform two simple regressions. The first regression analysis has an R-squared of 0.80 and a beta coefficient of 0.10. The 
second regression analysis has an R-squared of 0.80 and a beta coefficient of 0.25. Which one of the following statements is most 
accurate? 

X A) Results of the second analysis are more reliable than the first analysis. 

X B) Results from the first analysis are more reliable than the second analysis. 

X C) The influence on the dependent variable of a one-unit increase in the independent variable is the same in 
both analyses. 

y D) Results from both analyses are equally reliable. 

Explanation 

The coefficient of determination (R-squared) is the percentage of variation in the dependent variable explained by the variation in the 
independent variable. The R-squared (0.80) being identical between the first and second regressions means that 80 percent of the 
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variability in the dependent variable is explained by variability in the independent variable for both regressions. This means that the first 
regression has the same explaining power as the second regression. 


Question #20 of 37 Question ID: 438914 

Joe Harris is interested in why the returns on equity differ from one company to another. He chose several company-specific variables to 
explain the return on equity, including financial leverage and capital expenditures. In his model: 

X A) return on equity is the explanatory variable, and financial leverage and capital expenditure are the 
explained variables. 

X B) return on equity, financial leverage, and capital expenditures are all independent variables. 

■J C) return on equity is the dependent variable, and financial leverage and capital expenditures are independent 
variables. 

X D) return on equity is the independent variable, and financial leverage and capital expenditures are dependent 
variables. 

Explanation 

The dependent variable is return on equity. This is what he wants to explain. The variables he uses to do the explaining (i.e. , the 
independent variables) are financial leverage and capital expenditures. 


Question #21 of 37 Question ID: 438918 

The independent variable in a regression equation is called all of the following EXCEPT : 

X A) predicting variable. 

X B) explanatory variable. 

X C) exogenous variable. 

■J D) predicted variable. 

Explanation 

The dependent variable is the predicted variable. 


Question #22 of 37 Question ID: 438935 

Which term is least likely to apply to a regression model? 

X A) Coefficient of determination. 

X B) Goodness of fit. 

X C) R 2 . 

•/ D) Coefficient of variation. 

Explanation 

Goodness of fit, coefficient of determination and R 2 are different names for the same concept. The coefficient of variation is not directly 
part of a regression model. 
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Question #23 of 37 Question ID: 438927 

Which of the following are included in a sample regression function? 

I. The intercept. 

II. The error term. 

III. The slope coefficient. 

IV. The independent variable. 

X A) I, II, III, and IV. 

X B) III and IV only. 

X C) I and III only. 

■J D) 1,111, and IV only. 

Explanation 

The sample regression function has a residual and not an error term. Although the residual and error term serve a similar purpose in their 
respective equations, there are distinctions. 


Question #24 of 37 Question ID: 438917 

A regression analysis has the goal of: 

X A) measuring how the properties of the variables regress towards each other. 

/ B) estimating how changes in independent variables affect a dependent variable. 

X C) estimating how changes in dependent variable affect an independent variable. 

X D) measuring the tendency of both independent and dependent variables to regress towards their respective 
means. 

Explanation 

A regression analysis has the goal of measuring how changes in one variable, called a dependent or explained variable, can be explained 
by changes in one or more other variables called the independent or explanatory variables. 


Question #25 of 37 Question ID: 424958 

Which of the following is least likely an assumption of linear regression analysis? 

X A) The Y values are all less than 3 standard deviations from the regression line. 

X B) The error term is normally distributed. 

X C) The expected value of the residuals is zero. 

X D) The X values are uncorrelated with the error terms. 

Explanation 

In a normal distribution, approximately 99%, but not necessarily all, of the observations are within 3 standard deviations of the regression 
line. 
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Question #26 of 37 Question ID: 438939 

A simple linear regression is run to quantify the relationship between the return on the common stocks of medium sized companies (Mid 
Caps) and the return on the S&P 500 Index, using the monthly return on Mid Cap stocks as the dependent variable and the monthly 
return on the S&P 500 as the independent variable. The results of the regression are shown below: 




Standard Error 



Coefficient 

of coefficient 

t- Value 

Intercept 

1.71 

2.950 

0.58 

S&P 500 

1.52 

0.130 

11.69 


R 2 = 0.599 


The strength of the relationship, as measured by the correlation coefficient, between the return on Mid Cap stocks and the return on the 
S&P 500 for the period under study was: 


X A) 2.950. 

/ B) 0.774. 

X C) 0.599. 

X D) 0.130. 

Explanation 

You are given R 2 or the coefficient of determination of 0.599 and are asked to find R or the coefficient of correlation. The square root of 
0.599 = 0.774. 


Question #27 of 37 Question ID: 438941 

Sample regression coefficients are often estimated with a process known as: 

X A) a population regression function. 

X B) a scattergram. 

V C) ordinary least squares. 

X D) Ockham's razor. 

Explanation 

Ordinary least squares or OLS is a method for estimating population coefficients. It provides the line that minimizes the squared 
deviations of the dependent variables in the sample from the sample regression line. 


Question #28 of 37 Question ID: 438937 

What does the R 2 of a simple regression of two variables measure and what calculation is used to equate the correlation coefficient to 
the coefficient of determination? 

R2 measures: Correlation coefficient 
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X A) percent of variability of the 
independent variable that is 

R 2 = r 2 

explained by the variability of 
the dependent variable 

X B) percent of variability of the 
independent variable that is 

R 2 = r x 2 

explained by the variability of the 
dependent variable 

X C) percent of variability of the 
dependent variable that is 

R 2 = |-2 

explained by the variability of the 
independent variable 

X D) percent of variability of the 
dependent variable that is 

R 2 = r x 2 

explained by the variability of the 
independent variable 

Explanation 

R 2 , or the Coefficient of Determination, is the square of the coefficient of correlation (r). The coefficient of correlation describes the 
strength of the relationship between the X and Y variables. The standard error of the residuals is the standard deviation of the dispersion 
about the regression line. The t-statistic measures the statistical significance of the coefficients of the regression equation. In the 
response: "percent of variability of the independent variable that is explained by the variability of the dependent variable," the definitions 
of the variables are reversed. 


Question #29 of 37 Question ID: 438920 

An analyst is examining the relationship between two random variables, RCRANTZ and GSTERN. He performs a linear regression that 
produces an estimate of the relationship: 

RCRANTZ = 61.4 - 5.9GSTERN 
Which interpretation of this regression equation is least accurate? 

X A) The covariance of RCRANTZ and GSTERN is negative. 

X B) In this regression, RCRANTZ is the dependent variable and GSTERN is the independent variable. 

X C) If GSTERN increases by one unit, RCRANTZ should increase by 5.9 units. 

X D) The intercept term implies that if GSTERN is zero, RCRANTZ is 61 .4. 

Explanation 

The slope coefficient in this regression is -5.9. This means a one unit increase of GSTERN suggests a decrease of 5.9 units of 
RCRANTZ. The slope coefficient is the covariance divided by the variance of the independent variable. Since variance (a squared term) 
must be positive, a negative slope term implies that the covariance is negative. 
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Question #30 of 37 


Question ID: 438922 


Given: Y = 2.83 + 1.5X 

What is the predicted value of the dependent variable when the value of an independent variable equals 2? 

X A) 2.83 

X B) 5.83 
X C) 6.50 
X D) -0.55 

Explanation 

Y = 2.83 + (1.5)(2) 

= 2.83 + 3 
= 5.83 


Question #31 of 37 Question ID: 438926 

In the estimated regression equation Y = 0.78 - 1 .5 X, which of the following is least accurate when interpreting the slope coefficient? 

X A) The dependent variable declines by -1.5 units if X increases by 1 unit. 

X B) The dependent variable increases by 1 .5 units if X decreases by 1 unit. 

X C) -1 .5 is the elasticity of Y with respect to X. 

Y D) If the value of X is zero, the value of Y will be -1.5. 

Explanation 

The slope represents the change in the dependent variable for a one-unit change in the independent variable. If the value of X is zero, the 
value of Y will be equal to the intercept, in this case, 0.78. 


Question #32 of 37 


Question ID: 438942 


As part of a regression analysis, an analyst finds that: Y - bi * X = -1 .8 and b 1 = 3.2. Based upon these results, for every unit increase in 
the independent variable, on average the dependent variable increases by: 

X A) 5.0. 

y B) 3.2. 

X C) 1.8. 

X D) 1.4. 


Explanation 

bi gives the slope coefficient which indicates the average change in the dependent variable given a unit change in the independent 
variable. 


Question #33 of 37 


Question ID: 438928 
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Trudy Baker, FRM and Steven Phillips, FRM are planning to do a regression analysis. They discuss specifying the equation they wish to 
estimate. Baker proposes the specification E(Yj|Xj) = Bo + (B-i) x (Xj 2 ). Phillips proposes the specification (Yi|Xi) = Bo + (B-i x Xj) 2 . 
Which, if either, is appropriate when applying linear regression? 

■J A) The specification of Baker but not that of Phillips. 

X B) Both the specification of Baker and Phillips. 

X C) The specification of Phillips but not that of Baker. 

X D) Neither the specification of Baker nor that of Phillips. 

Explanation 

Since the specification of Phillips would essentially be Bo + (Bi 2 ) x (Xj 2 ), this precludes the application of linear regression analysis 
because there is an exponent on B-i, i.e. , the specification is nonlinear with respect to the parameters. When we refer to a linear 
regression model we generally assume that the equation is linear in the parameters; it may or may not be linear in the variables. 


Question #34 of 37 Question ID: 438923 

Paul Frank is an analyst for the retail industry. Fie is examining the role of television viewing by teenagers on the sales of accessory 
stores. He gathered data and estimated the following regression of sales (in millions of dollars) on the number of hours watched by 
teenagers (TV, in hours per week): 

Sales t =1.05+1.6 TV t 

The predicted sales if television watching is 5 hours per week is: 

X A) $8.00 million. 

■J B) $9.05 million. 

X C) $2.65 million. 

X D) $1.05 million. 

Explanation 

The predicted sales are: Sales = $1.05 + [$1.6 (5)] = $1.05 + $8.00 = $9.05 million. 


Question #35 of 37 Question ID: 438925 

Sera Smith, a research analyst, had a hunch that there was a relationship between the percentage change in a firm's number of 
salespeople and the percentage change in the firm's sales during the following period. Smith ran a regression analysis on a sample of 50 
firms, which resulted in a slope of 0.72, an intercept of +0.01 , and an R 2 value of 0.65. Based on this analysis, if a firm made no changes 
in the number of sales people, what percentage change in the firm's sales during the following period does the regression model predict? 

X A) +0.65%. 

V B) +1.00%. 

X C) +0.72%. 

X D) +0.10%. 

Explanation 
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The slope of the regression represents the linear relationship between the independent variable (the percent change in sales people) and 
the dependent variable, while the intercept represents the predicted value of the dependent variable if the independent variable is equal 
to zero. In this case, the percentage change in sales is equal to: 0.72(0) + 0.01 = +0.01. 


Question #36 of 37 Question ID: 438930 

The assumptions underlying linear regression include all of the following EXCEPT the: 

■J A) independent variable is linearly related to the residuals (or disturbance term). 

X B) disturbance term is normally distributed with an expected value of 0. 

X C) dependent variable and independent variable are linearly related. 

X D) disturbance term is homoskedastic and is independently distributed. 

Explanation 

The independent variable is uncorretated with the residuals (or disturbance term). 

The other statements are true. The disturbance term is homoskedastic because it has a constant variance. It is independently distributed 
because the residual for one observation is not correlated with that of another observation. Note: The opposite of homoskedastic is 
heteroskedastic. For the examination, memorize the assumptions underlying linear regression! 


Question #37 of 37 Question ID: 438919 

Paul Frank is an analyst for the retail industry. He is examining the role of television viewing by teenagers on the sales of accessory 
stores. He gathered data and estimated the following regression of sales (in millions of dollars) on the number of hours watched by 
teenagers (in hours per week): 

Sales t =1.05+1.6 TV t 

Which of the following is the most accurate interpretation of the estimated results? If TV watching: 

X A) goes up by one hour per week, sales of accessories increase by $1.60. 

X B) is zero (that is, every teenager turns off the TV for a week), the expected sales of accessories is $0. 

V C) goes up by one hour per week, sales of accessories increase by $1 .6 million. 

X D) changes, no change in sales is expected. 

Explanation 

The interpretation of the slope coefficient is the change in the dependent variable (sales in millions of dollars) for a given one-unit change 
in the independent variable (TV hours per week). The intercept of 1 .05 means that 1 .05 million dollars worth of accessories is expected 
to be sold even if TV watching is zero. 
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